RLMAgent module

`RecursiveLanguageModelAgent`

Bases: FunctionCallingAgent

A recursive-language-model agent.

Each turn the LM calls a single tool, run_python_code(code: str), which runs the snippet in a persistent REPL sandbox (by default a MirageSandbox) and returns {"stdout", "stderr", "error"}. State (variables, imports, function definitions) accumulates across turns so the agent can build up intermediate values, probe data, and iterate. submit, the recursive helpers and any user tools are not exposed to the LM as tools — they live inside the sandbox as plain synchronous functions (advertised through the tools catalog), reachable only from the code passed to run_python_code.

When recursive=True (the default), two extra helpers are exposed inside the sandbox: llm_query(prompt) and llm_query_batched(prompts). The agent then treats long inputs as an external environment, it writes Python that slices, filters, and aggregates the data, and recursively delegates semantic work to a sub-LM only on the snippets it cares about. Compared to feeding a long document straight into the primary LM, this trades a single huge context for many small ones, which both fits inside provider limits and reduces the chance of long-context regressions.

When recursive=False, the agent runs without the sub-LM helpers, useful when the task is purely computational and recursion would only add cost.

Bound user tools (if any) appear inside the sandbox as global functions; scripts call them directly, result = tool_name(...).

Termination: the snippet calls the in-sandbox submit(result=...) callable with the final payload. If max_iterations is reached without submit, a final inference step formats the accumulated trajectory into the target schema / data_model. Empty snippets are not termination signals, the loop feeds back a reminder and keeps going.

The llm_query quota is per-call: every invocation of this agent gets a fresh budget of max_llm_calls sub-LM queries, and concurrent invocations of the same agent instance each get an independent budget — the counter and lock are built inside call() and never shared across runs.

Example:

import synalinks
import asyncio

class Doc(synalinks.DataModel):
    text: str

class Answer(synalinks.DataModel):
    answer: str

async def main():
    primary = synalinks.LanguageModel(model="openai/gpt-4o")
    cheap = synalinks.LanguageModel(model="openai/gpt-4o-mini")
    inputs = synalinks.Input(data_model=Doc)
    outputs = await synalinks.RLM(
        data_model=Answer,
        language_model=primary,
        sub_language_model=cheap,
        max_iterations=8,
        max_llm_calls=20,
    )(inputs)
    agent = synalinks.Program(inputs=inputs, outputs=outputs)
    long_text = open("book.txt").read()
    result = await agent(Doc(text=long_text))
    print(result.prettify_json())

if __name__ == "__main__":
    asyncio.run(main())

References

Recursive Language Models

Parameters:

Name	Type	Description	Default
`schema`	`dict`	Optional. The target JSON schema for the final structured answer. If not provided, use `data_model` to infer it. When both are omitted, the agent runs in schemaless mode, the final generator emits a `ChatMessage` that is appended to the trajectory, and `call` returns the `ChatMessages` trajectory directly.	`None`
`data_model`	`DataModel \| SymbolicDataModel \| JsonDataModel`	Optional. The target data model for the final answer.	`None`
`language_model`	`LanguageModel`	The language model driving the per-turn code generator and the final-formatting step.	`None`
`sub_language_model`	`LanguageModel`	Optional. The language model used by `llm_query` and `llm_query_batched` when `recursive=True`, and by spawned subagents. Defaults to `language_model`, pass a cheaper / smaller model here when the recursive sub-queries don't need the primary LM's full capability. Ignored when `recursive=False`.	`None`
`prompt_template`	`str`	Optional. Prompt template forwarded to the per-turn code generator.	`None`
`examples`	`list`	Optional. Examples forwarded to the per-turn code generator.	`None`
`instructions`	`str`	Optional. Instructions for the per-turn code generator. Defaults to either `get_recursive_instructions` (when `recursive=True`, with the `{max_llm_calls}` placeholder substituted) or `get_default_instructions` otherwise.	`None`
`final_instructions`	`str`	Optional. Instructions for the final answer generator. Defaults to `instructions`.	`None`
`temperature`	`float`	Optional. Sampling temperature (Default 0.0).	`None`
`max_tokens`	`int`	Optional. Maximum number of tokens to generate. Default None (the model's own default; caps generation length).	`None`
`top_p`	`float`	Optional. Nucleus sampling probability. Default None (the model's own default).	`None`
`top_k`	`int`	Optional. Top-k sampling cutoff. Default None (the model's own default).	`None`
`use_inputs_schema`	`bool`	Optional. Feed the input schema to the generator prompt (Default False).	`False`
`use_outputs_schema`	`bool`	Optional. Feed the output schema to the generator prompt (Default False).	`False`
`reasoning_effort`	`str`	Optional. One of `'minimal'`, `'low'`, `'medium'`, `'high'`, `'disable'`, `'none'`, `None`. Default `None`.	`None`
`use_chain_of_thought`	`bool`	Optional. Wrap the per-turn generator in ChainOfThought so it emits a `thinking` field alongside the tool call. Default `False`.	`False`
`tools`	`list`	Optional. Extra `Tool` instances exposed to the sandbox in addition to `submit` (and `llm_query` / `llm_query_batched` when `recursive=True`). The names `submit`, `llm_query`, and `llm_query_batched` are always reserved at construction time, even when `recursive=False`, so tool naming stays stable across the two modes. Naming gotcha: each tool is registered under `tool.name == tool._func.__name__`. `Tool(_my_helper)` shows up inside the script as `_my_helper`. Rename the function rather than relying on an alias.	`None`
`autonomous`	`bool`	Optional. If `True` (default), run the full code/execute/observe loop until the LM calls `submit` or `max_iterations` is reached, then produce a structured final answer. If `False`, require a `ChatMessages` input and execute a single code turn per call, returning the updated trajectory, suitable for human-in-the-loop use. For cross-call REPL state in interactive mode, hand a `Sandbox` to `call` via the `sandbox` kwarg; the agent itself stays stateless.	`True`
`return_inputs_with_trajectory`	`bool`	Optional. Whether to return the full trajectory alongside the final answer (Default `True`).	`True`
`max_iterations`	`int`	Maximum number of code-execution turns before forcing the final answer step (Default 20).	`20`
`timeout`	`int`	Per-turn execution budget in seconds (Default 60). Recursive sub-LM calls dominate per-turn wall time; `llm_query_batched` of even a handful of prompts can take several seconds. Snippets that exceed the budget turn into an observation so the LM can recover on the next turn.	`60`
`recursive`	`bool`	Optional. If `True` (default), expose `llm_query` and `llm_query_batched` inside the sandbox and use the recursive instructions. If `False`, run without the sub-LM helpers.	`True`
`max_llm_calls`	`int`	Hard cap on sub-LM calls per agent invocation, shared between `llm_query` and `llm_query_batched` (Default 50). Once the budget is spent, further calls return an error string instead of a response so the LM can fall back to code-side aggregation. Ignored when `recursive=False`.	`50`
`max_output_chars`	`int`	Maximum characters to include from REPL output in the per-turn observation (Default 10_000). Anything beyond is truncated with a `… (truncated, N chars omitted)` marker so a single noisy turn cannot blow up the trajectory.	`10000`
`workdir`	`str`	Optional. Host directory the agent operates on. When building its own sandbox (i.e. no `sandbox` instance is supplied), the workdir seeds the sandbox filesystem. If it contains an `AGENTS.md` file, its contents are also injected as an additional input so the agent follows the declared project conventions (see `read_agents_md`). Must point to an existing directory. Defaults to `None`.	`None`
`skills`	`list`	Optional. Folder paths (Agent Skill roots) whose skills are listed for the agent as an `<available_skills>` context message (see `FunctionCallingAgent`). The skill files must also be reachable from the agent's sandbox (e.g. under `workdir`) for their bodies to be read on demand. Defaults to `None`.	`None`
`sandbox`	`Sandbox`	Optional. A pre-built `Sandbox` instance to reuse across calls. When supplied, the agent will not build its own sandbox at `call()` time and `sandbox_type` is derived from `type(sandbox)`. Pass this when the caller owns the sandbox lifecycle (e.g. interactive sessions where REPL state must persist across calls). When omitted, a fresh sandbox of `sandbox_type` is built per call.	`None`
`sandbox_type`	`type`	Optional. The `Sandbox` subclass to instantiate when no sandbox is supplied (here or to `call()`). Defaults to `MirageSandbox`, or to `type(sandbox)` when `sandbox` is given. Any `Sandbox` subclass whose `__init__` accepts `(timeout=..., name=...)` works; register custom subclasses with `@register_synalinks_serializable` so they round-trip through `get_config` / `from_config`.	`None`
`max_subagent_depth`	`int`	When `> 0`, the agent gains `spawn_subagents` / `merge_subagent` / `discard_subagent` tools (called between snippets, with the REPL idle, so they can fork it). Each subagent runs in parallel on a `Sandbox.fork` that inherits this agent's current REPL state (variables, functions, imports) and files; its work only lands on an explicit `merge_subagent`. `1` (recommended) lets this agent spawn subagents that cannot themselves spawn; higher values allow nesting. Defaults to `0` (disabled). Across parallel subagents you can fold back all their file changes, but only one subagent's REPL namespace per `spawn_subagents` batch (via `merge_subagent(..., adopt_repl= True)`): the REPL serializes only as a whole, so parallel namespaces can't be unioned. That is a backend constraint, not a design shortcut.	`0`
`name`	`str`	Optional. The name of the module.	`None`
`description`	`str`	Optional. The description of the module.	`None`